Natural Language Access To Structured Text
نویسندگان
چکیده
This paper d i s c u s s e s the problem of p r o v i d i n g n a t u r a l l anguage access to textual material. We are developing a system that r e l a t e s a r e q u e s t i n E n g l i s h to s p e c i f i c p a s s a g e s i n a document on the b a s i s of co r respondences between the l o g i c a l r e p r e s e n t a t i o n s of the i n f o r m a t i o n i n the r e q u e s t and i n the p a s s a g e s . In a d d i t i o n , we a r e d e v e l o p i n g p rocedures f o r a u t o m a t i c a l l y g e n e r a t i n g l o g i c a l r e p r e s e n t a t i o n s of t e x t p a s s a g e s , d i r e c t l y from the t e x t , by means of an a n a l y s i s of t he coherence s t r u c t u r e of the p a s s a g e s .
منابع مشابه
Presenting a method for extracting structured domain-dependent information from Farsi Web pages
Extracting structured information about entities from web texts is an important task in web mining, natural language processing, and information extraction. Information extraction is useful in many applications including search engines, question-answering systems, recommender systems, machine translation, etc. An information extraction system aims to identify the entities from the text and extr...
متن کاملA Structured Approach for Building Assamese Corpus: Insights, Applications and Challenges
To study about various naturally occurring phenomenons on natural language text, a well structured text corpus is very much essential. The quality and structure of a corpus can directly influence on performance of various Natural Language Processing applications. Assamese is one of the major Indian languages used by the people of north east India. Language technology development works in Assame...
متن کاملText is Software Too
Software compiles and therefore is characterized by a parseable grammar. Natural language text rarely conforms to prescriptive grammars and therefore is much harder to parse. Mining parseable structures is easier than mining less structured entities. Therefore, most work on mining repositories focuses on software, not natural language text. Here, we report experiments with mining natural langua...
متن کاملA New Method for Improving Computational Cost of Open Information Extraction Systems Using Log-Linear Model
Information extraction (IE) is a process of automatically providing a structured representation from an unstructured or semi-structured text. It is a long-standing challenge in natural language processing (NLP) which has been intensified by the increased volume of information and heterogeneity, and non-structured form of it. One of the core information extraction tasks is relation extraction wh...
متن کاملTowards Design and Implementation of a Language Technology based Information Processor for PDM Systems
Product Data Management (PDM) aims to provide ‘Systems’ contributing in industries by electronically maintaining organizational data, improving data repository system, facilitating with easy access to CAD and providing additional information engineering and management modules to access, store, integrate, secure, recover and manage information. Targeting one of the unresolved issues i.e., provis...
متن کاملStructured Querying of Web Text A Technical Challenge
The Web contains a huge amount of text that is currently beyond the reach of structured access tools. This unstructured data often contains a substantial amount of implicit structure, much of which can be captured using information extraction (IE) algorithms. By combining an IE system with an appropriate data model and query language, we could enable structured access to all of the Web’s unstru...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 1982